Competitive cluster growth in complex networks 
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Understanding the process by wliicli tlie individuals of a society make up their minds and reach 
opinions about different issues can be of fundamental importance. In this work we propose an 
idealized model for competitive cluster growth in complex networks. Each cluster can be thought 
as a fraction of a community that shares some common opinion. Our results show that the cluster 
size distribution depends on the particular choice for the topology of the network of contacts among 
the agents. As an application, we show that the cluster size distributions obtained when the growth 
process is performed on hierarchical networks, e.g., the Apollonian network, have a scaling form 
similar to what has been observed for the distribution of number of votes in an electoral process. 
We suggest that this similarity is due to the fact that social networks involved in the electoral 
process may also posses an underlining hierarchical structure. 
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The sedimentation of new trends and ideas in large so- 
cial communities can have a profound impact in the life of 
individuals. An instance where the dynamics of opinion 
formation may be o major importance is the democratic 
elections of representatives. In addition, in the electoral 
process every agent is called to give her/his opinion in 
an anonymous way and the statistical results are easily 
accessible 0. This makes elections ideal systems to re- 
searchers interested in studying the process of opinion 
formation. Since, typically, an individual is more likely 
to listen to someone they have a personal contact, the 
process may also be driven by a mouth-to-mouth inter- 
action besides from massive political campaigns based on 
media programs. In this way, the spreading of an opinion 
follows apattern similar to the spreading of an epidemic 
process jj]- One should also expect that the particular 
structure of the network of contacts among the agents of 
a society may have an impact in the way the opinions 
propagate. 

The intricate structure of interactions of many nat- 
ural and social systems has been object of intense re- 
search in the new area of complex networks. Most of 
the effort in this area has been directed to find the topo- 
logical properties of real world networks 0, 0, IS IE 
and understand the effects that these properties cast on 
dynamicalprocesses taking place on these complex net- 
works |M I^^ITollTH. For instance, the small- world char- 
acteristic where each node of the network is only a 
few connections apart from any other, permits a quick 
spreading of information through the network, being fun- 
damental in processes of global coordination ^12^ and 
feedback regulation [T^ . 

Another property commonly studied in complex net- 
works is the degree distribution P{k), that gives the prob- 
ability with which an arbitrary node is connected to ex- 
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actly k other nodes. One relevant characteristic often 
observed in complex networks is a scale- free degree distri- 
bution 4], namely, a distribution which follows a power- 
law, P{k) ^ k^^ , with an exponent typically in the range 
2 < 7 < 3. Such broad degree distribution has a dramatic 
effect in many dynamical processes. In the spreading of 
infectious diseases, for example, it has been shown that 
when the infection is mediated by a scale-free network, 
any infection rate above zero results in a positive fraction 
of infected individuals T?] . It was recently suggested that 
another universal characteristic of real-world networks is 
a structure of communities, where smaller communities 
in the network are joined to larger communities by highly 
connected nodes that play the role of local hubs . This 
structure may be related to the self-similar characteristic 
observed in some complex- networks [T^. 

In this letter we investigate a dynamic process of com- 
petitive cluster growth in complex networks. In this pro- 
cess many alternative and self-excluding states are acces- 
sible to the nodes of the network. We say that nodes in 
the same state belong to a cluster, and each of these clus- 
ters competes with the others to reach a larger part of 
the network. This idealized mechanism can be thought 
as a model for a variety of different processes that take 
place in real networks. For example, one can think of 
each cluster as the part of a population that has been 
infected with a certain strain of a virus. Alternatively, 
the clusters may represent alternative opinions in a social 
group. We study the distribution of the fraction of the 
network occupied by an arbitrary cluster. Our results 
show that the network topology has great influence over 
the behavior of the cluster size distributions. 

Our model for competitive cluster growth is described 
as follows: the process is mediated by a substrate net- 
work with N nodes; in the first moment, a number Ug of 
nodes is chosen at random to be the seeds of the spread- 
ing process, with the density of seeds being Ug/N . Each 
seed will be the first node of a cluster. Then, the clusters 
grow by incorporating nodes that are neighbors of these 
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FIG. 1: Pictorial description of our model for a competitive 
cluster growth process. To make a clear picture of our model, 
we used as a substrate network for the growth a third gen- 
eration Apollonian network \T^ . The matching graph form 
of this network makes it particularly suitable for a plane rep- 
resentation. In the first step of the growth model (a), a few 
nodes (the large circle, square, triangle, and star) are chosen 
to be the seeds of the growing clusters. Each of the seeds is 
the first node of a different cluster. All the remaining nodes 
(small dots) do not assigned to any of the clusters in the be- 
ginning, this nodes will be the ones accessible to the growth. 
The dashed lines link either a pair of nodes that already be- 
longing to one of the clusters or a pair of nodes that are not 
in any cluster yet. At this time-step these lines do not par- 
ticipate in the growth process. The thick lines link one of the 
seeds to an accessible node, and any of the thick lines can be 
chosen with the same probability to channel the growth of a 
cluster. In this way, the growth rate of a particular cluster 
is proportional to its perimeter, that is, the number of con- 
nections from one of the nodes already incorporated into the 
cluster to an accessible node. In the second step (b) a new 
node is incorporated the cluster of squares and new thick lines 
are added to the perimeter of this cluster. The process con- 
tinues until every one of the nodes have been incorporated to 
one of the clusters (c). At this point the size of all the clusters 
is computed. This mechanism of cluster growth is a greedy 
process. The larger the perimeter of a cluster faster it will 
grow incorporating more nodes and more connections. 



seeds and have not yet been assigned to any other clus- 
ter. Once a node is incorporated to a cluster it will stay 
in this cluster until the end of the process, and only the 
nodes not belonging to any of the existing clusters are 
accessible to the growth process. We will refer to these 
as accessible nodes. The growth process takes place in 
discrete steps. At each step, we randomly select a pair of 
connected nodes, one belonging to a cluster and the other 
that is accessible to growth. The accessible node is subse- 
quently incorporated to the same cluster as its neighbor. 
In this way, the average rate of growth of a cluster is pro- 
portional to the number of accessible nodes in its perime- 
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FIG. 2: Normalized cluster size distributions obtained for 
the competitive growth model under a random ER network 
topology. These distributions were obtained for networks of 
10^ nodes. We performed a larger number of runs for the 
cases with less competing clusters in order to have 10^ clus- 
ter samples for each case. The distributions obtained un- 
der this topology follow approximately an exponential dis- 
tribution with a characteristic cluster size depending on the 
number Us of competing clusters. The dashed lines are the 
functions nae'""". The continuous lines are a fit for the 
function P{n) = Cnl 'n~^ exp(— nan), with the parameters 
C = 0.85 ± 0.03, and 7 = 0.2 ± 0.05 obtained for the col- 
lapsed data shown in the inset. This figure shows that for the 
random networks the form of the distribution is completely 
determined by the density of nodes chosen to be seeds for the 
growth process. 



ter. The process continues until all the nodes have been 
incorporated to one of the competing clusters and no ac- 
cessible node is left on the network. Figure ^ presents 
a pictorial description of our growth model. We suggest 
that this mechanism can be representative of some sort 
of greedy process where large clusters, with many nodes, 
will have more connections to accessible nodes and there- 
fore tend to grow faster and increase even more their 
perimeters. In this way, our model resembles the pref- 
erential attachment model for network growth 41. How- 
ever, the competitive growth has significant differences 
from that mode, in the sense that all seeds are present in 
the beginning of the process and also the fact that some 
clusters will start to grow before others, resulting in a 
variety of cluster sizes. 

The choice of a particular topology for the network of 
contacts should affect the growth process. The simplest 
model for a network is probably the random network 
model proposed by Erdos and Reny (ER) 0. In this 
model, any pair of nodes can be connected with prob- 
ability p, and the degree distribution follows approxi- 
mately a Poisson form with the average degree given by 
k = p{N — 1). In Fig. |21 we show the distribution of 
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FIG. 3: Normalized cluster size distributions obtained when 
the growth process is performed in networks built with the 
preferential attachment schema. We used networks with 10^ 
nodes and obtained 10^ cluster samples. Different from the 
the case of random networks, here we can not produce a data 
collapse for the distributions. We note, however, that for any 
value of ris we have a broad distribution with clusters of all 
sizes ranging from just a few nodes to most of the network. 
The continuous line is a power-law fit, P{n) ~ i"", with 
the exponent a — 1.2 obtained for the scaling region of the 
distribution for ris = 10^. The slope of the becomes more 
steep as the number of competing clusters in the network 
grows. 



the fractions of the network corresponding to each clus- 
ter P{n) when the spreading process takes place on ER 
networks. In this case the cluster size distributions are 
power-laws with a small exponent bounded by an expo- 
nential cutoff with a characteristic scale depending only 
on the density of competing clusters in the network. 

Next, we study the cluster growth process on a scale- 
free network. To build the scale-free network we use 
the so-called preferential attachment method p]. Dif- 
ferent from the random graph, the degree distribution 
of the networks built with this model have a power- 
law form, Pk{k) '~ fc~^, with an exponent 7 = 3, fol- 
lowed by an exponential cutoff at a maximum degree, 
kmax fill- Note that, a cluster that incorpo- 

rates one of the most connected nodes in the beginning 
of the growth process will increase its growth rate by 
a large factor. Thus, the long tail of this distribution 
may have a dramatic effect on the cluster growth pro- 
cess. Indeed, the cluster size distribution obtained when 
the spreading process is done in the preferential attach- 
ment network also shows a power-law tail, p{n) ^ n~°', 
with an exponent a = 1.2 for the case with — 10^ plf . 
as seen in Fig. 13 Interestingly, the competing growth dy- 
namics magnifies the effect of the scale-free distribution 
producing distributions of cluster sizes that have smaller 
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FIG. 4: Normalized cluster size distribution when our model 
is performed on Apollonian networks. The distributions were 
obtained with the 15th generation of the Apollonian network 
what corresponds to A'' = 7174456. Each curve was sampled 
from an ensemble of 10^ clusters. As in the case of the pref- 
erential attachment networks here we observe a power-law 
decay with an exponent a ~ — 1. The continuous line is a 
fit for the scaling region of data with Ua — 100 initial seeds. 
The periodical steps observed in the shape of the distribu- 
tions is remnant of the self similar structure resulting from 
the hierarchical construction of the Apollonian network. 



exponents, and as a consequence, a slower decay than 
that observed for the degree distribution. The scattering 
of the data close to the limit n ^ 1 is due to statisti- 
cal fluctuations at this low frequency limit as well as to 
finite size effects. The implication of these heavy tailed 
distributions is that now the average cluster size is not a 
characteristic scale for the process since one finds, with 
relatively high frequency, clusters that are orders of mag- 
nitude larger than the average. 

Although the preferential attachment model produces 
a degree distribution similar to what is found in several 
real networks, it does not display the self-similarity and 
hierarchical structure also observed in many of those net- 
works. In the work of Ravasz et al. 0, it was suggested 
that the structure of a network could be probed in a 
quantitative way by studying the cluster coefficient of its 
nodes. The cluster coefficient of a node is defined as the 
probability that two of its neighbors taken at random 
are connected. A signature of the hierarchical struc- 
ture would be a cluster coefficient proportional to the 
inverse of the nodes degree c(fc) ~ 0. Both the 
preferential attachment and the random networks do not 
present this property. A model that shows small-world 
behavior, scale-free degree distribution as well as a hi- 
erarchical structure is the recently proposed Apollonian 
network [l9l |. 

In order to test the effect of a hierarchical structure 
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in the cluster growth we implemented our model in the 
topology of the Apollonian network. In Fig. 0] we show 
that the cluster size distributions obtained for the clus- 
ters grown in this topology follow power-law behavior 
with an exponent a = —1. Such behavior can be un- 
derstood with a simple scaling argument. Splitting the 
Apollonian network in the most connected hubs one finds 
three smaller networks corresponding to Apollonian net- 
works of a lower generation. Each of these pieces could 
be split again producing nine smaller networks with this 
hierarchical disassembly continuing down to the level of 
single nodes. After the growth process is performed in a 
large Apollonian network, one could measure the cluster 
size distribution for each of these generations. It should 
be expected that, as one goes to higher generations, the 
distributions approach a limiting form. Note also that 
only the clusters that reach the three hubs in the corner 
of the networks are merged when one moves one genera- 
tion up, and only the size of these few clusters that reach 
these hubs change from one generation to another. Thus, 
this process can change the form of the distribution only 
in the limit of very large clusters, where the frequency 
is of the order of the inverse of the network size. This 
means that the distribution of cluster fractions should 
obey approximately the following similarity relation, 

P{n) = 3F(3n). (1) 

A trivial solution for this similarity relation is the Dirac 
delta We expect the distribution to assume this 

form if, with probability one, the growth process pro- 
duces one giant cluster that incorporates a large frac- 
tion, n « 1, as grows. Besides the delta function, 
any nontrivial function that satisfies the similarity rela- 



tion Q should be of the form C/n. Although we have 
not proved that a giant cluster does not appear, our nu- 
merical results indicate that, at least for the densities of 
seeds we have investigated, this is unlikely. 

Intriguingly, the scaling found for the distributions ob- 
tained with the Apollonian network is similar to those 
found in the distribution of number of votes per candi- 
date in the elections in Brazil . This leads us to suggest 
that the universal behavior observed in the electoral pro- 
cesses may by driven by an underling hierarchical struc- 
ture of the social networks describing the interactions 
among voters. 

In summary, we have introduced a model for compet- 
itive cluster growth in complex networks. By means of 
numerical simulations we have shown that the fraction of 
the network accessed by each cluster follows a character- 
istic distribution that depends on the particular topol- 
ogy of the network. For the case of a random graph we 
found an exponential decay while for scale-free networks 
we found power-law decaying distributions. In the par- 
ticular case of a hierarchical scale-free network, we ob- 
served a decay with a governing exponent a = — 1. The 
distributions we obtained with our model resemble the 
distributions of the fraction of votes per candidates ob- 
served in the proportional elections in Brazil This 
leads us to conjecture that an hierarchical structure may 
be present in the social network governing the process by 
which voters reach their decisions. 
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